An Estimation of Missing Values by Modified Mixed Kernels

نویسنده

  • M. Hemalatha
چکیده

----In statistical practices, difficulties of missing data are universal. Several techniques are used to handle this dilemma of missing data. They include both old approaches, which require only a small amount of mathematical computations and new approaches, which require additional difficult computations that are ever easier for social work researchers to carry out the statistical programming softwares. In the existing system, there is a novel setting of missing data imputation, i.e. imputing in mixed-attribute data sets. This system offers two consistent estimators for discrete and continuously missing target values, correspondingly. After that a mixture-kernel based iterative estimator is offered to impute mixed-attribute data sets. In this method, the local kernel and global kernel are used and linear combination of these mixed kernels is used. Nevertheless, the accuracy of the system is decreased with the large number of data samples. Unquestionably it will degrade the performance of the system. To improve the performance and to increase the accuracy of the system we proposed three approaches. First we introduce the local kernal RBF using KL divergence, secondly we introduce the global kernal polynomial using probability distribution and finally mixed kernels in piece level combination instead of linear combination. From the experimental result we can obtain that the proposed system is much more effective than the existing system. The performance also is shown to have improved in this proposed system. Keywords----Missing data imputation, Kernel Function Selection, Linear Mixture Kernel Function, RBF kernel, Polynomial kernel and Statistical Imputation for Missing Data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of different estimation methods for missing rainfall data

There are numerous methods to estimate missing values of which some are used depending on the data type and regional climatic characteristics. In this research, part of the monthly precipitation data in Sarab synoptic station, east Azerbaijan province, Iran was randomly considered missing values. In order to study the effectiveness of various methods to estimate missing data, by seven classic s...

متن کامل

Ensemble Kernel Learning Model for Prediction of Time Series Based on the Support Vector Regression and Meta Heuristic Search

In this paper, a method for predicting time series is presented. Time series prediction is a process which predicted future system values based on information obtained from past and present data points. Time series prediction models are widely used in various fields of engineering, economics, etc. The main purpose of using different models for time series prediction is to make the forecast with...

متن کامل

Weighted Modified First Order Regression Procedures for Estimation in Linear Models with Missing X-Observations

This paper considers the estimation of coe cients in a linear regression model with missing obser vations in the independent variables and introduces a modi cation of the standard rst order regression method for imputation of missing values The modi cation provides stochastic values for imputation and as an extension makes use of the principle of weighted mixed regression The proposed proce dur...

متن کامل

Bayesian Inference for Spatial Beta Generalized Linear Mixed Models

In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...

متن کامل

Ensemble Learning with Supervised Kernels

Kernel-based methods have outstanding performance on many machine learning and pattern recognition tasks. However, they are sensitive to kernel selection, they may have low tolerance to noise, and they can not deal with mixed-type or missing data. We propose to derive a novel kernel from an ensemble of decision trees. This leads to kernel methods that naturally handle noisy and heterogeneous da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014